Sensitivity of Counting Queries
نویسندگان
چکیده
In the context of statistical databases, the release of accurate statistical information about the collected data often puts at risk the privacy of the individual contributors. The goal of differential privacy is to maximise the utility of a query while protecting the individual records in the database. A natural way to achieve differential privacy is to add statistical noise to the result of the query. In this context, a mechanism for releasing statistical information is thus a trade-off between utility and privacy. In order to balance these two “conflicting” requirements, privacy preserving mechanisms calibrate the added noise to the so-called sensitivity of the query, and thus a precise estimate of the sensitivity of the query is necessary to determine the amplitude of the noise to be added. In this paper, we initiate a systematic study of sensitivity of counting queries over relational databases. We first observe that the sensitivity of a Relational Algebra query with counting is not computable in general, and that while the sensitivity of Conjunctive Queries with counting is computable, it becomes unbounded as soon as the query includes a join. We then consider restricted classes of databases (databases with constraints), and study the problem of computing the sensitivity of a query given such constraints. We are able to establish bounds on the sensitivity of counting conjunctive queries over constrained databases. The kind of constraints studied here are: functional dependencies and cardinality dependencies. The latter is a natural generalisation of functional dependencies that allows us to provide tight bounds on the sensitivity of counting conjunctive queries. 1998 ACM Subject Classification F.4.1 Mathematical Logic: Model theory; H.2.3: Languages: query languages
منابع مشابه
Learning and Verifying Graphs Using Queries with a Focus on Edge Counting
We consider the problem of learning and verifying hidden graphs and their properties given query access to the graphs. We analyze various queries (edge detection, edge counting, shortest path), but we focus mainly on edge counting queries. We give an algorithm for learning graph partitions using O(n log n) edge counting queries. We introduce a problem that has not been considered: verifying gra...
متن کاملTitle : On the Learnability of Counting Functions
We examine the learnability of concepts based on counting functions. A counting function is a generalization of a parity function in which the weighted sum of n inputs is tested for equivalence to some value k modulo N . The concepts we study therefore generalize many commonly studied boolean functions. We first show that disjunctions of counting functions (DOCFs) with modulus N are learnable b...
متن کاملData Utility in Differential Privacy via Microaggregation-based k-Anonymity”
In addition to the general-purpose SSE-based utility evaluation conducted and discussed in the body of the article, in this appendix we provide evaluation results for a specific data use, namely counting queries. The reason of focusing on this data use is that many related works on differentially-private data publishing aim at preserving the utility for counting queries over protected data [12–...
متن کاملRange Counting with Distinct Constraints
In this paper we consider a special case of orthogonal point counting queries, called queries with distinct constraints. A d-dimensional orthogonal query range Q = [b1, b2]× [b3, b4]× . . .× [b2d−1, b2d] is a range with r distinct constraints if there are r distinct values among b1, b2, . . ., b2d. We describe a data structure that supports orthogonal range counting queries with r distinct cons...
متن کاملAnswering Counting Aggregate Queries over Ontologies of the DL-Lite Family
One of the main applications of description logics is the ontology-based data access model, which requires algorithms for query answering over ontologies. In fact, some description logics, like those in the DL-Lite family, are designed so that simple queries, such as conjunctive queries, are efficiently computable. In this paper we study counting aggregate queries over ontologies, i.e. queries ...
متن کامل